智能论文笔记

Modeling Quality and Machine Learning Pipelines through Extended Feature Models

Giordano d'Aloisio , Antinisca Di Marco , Giovanni Stilo

分类：机器学习

2022-07-15

最近增加的机器学习方法（ML）方法的复杂性导致了减轻研究和行业发展过程的必要性。 ML管道已成为许多领域，数据科学家和研究人员的专家的重要工具，使他们可以轻松地整理几个ML模型，以涵盖从RAW数据集开始的完整分析过程。多年来，已经提出了几种解决方案来自动化ML管道的构建，其中大多数集中在输入数据集的语义方面和特征上。但是，考虑到ML系统所需的新质量问题（如公平，解释性，隐私等）仍然缺失。在本文中，我们首先从文献中确定ML系统的关键质量属性。此外，我们通过正确扩展功能模型元模型，为优质ML管道提出了一种新的工程方法。提出的方法允许对ML管道进行建模，其质量要求（在整个管道和单个阶段）以及用于实现每个管道阶段的算法的质量特征。最后，我们证明了考虑分类问题的模型的表现力。

translated by 谷歌翻译

Many-valued Argumentation, Conditionals and a Probabilistic Semantics for Gradual Argumentation

Mario Alviano , Laura Giordano , Daniele Theseider Dupré

分类：人工智能

2022-12-14

In this paper we propose a general approach to define a many-valued preferential interpretation of gradual argumentation semantics. The approach allows for conditional reasoning over arguments and boolean combination of arguments, with respect to a class of gradual semantics, through the verification of graded (strict or defeasible) implications over a preferential interpretation. As a proof of concept, in the finitely-valued case, an Answer set Programming approach is proposed for conditional reasoning in a many-valued argumentation semantics of weighted argumentation graphs. The paper also develops and discusses a probabilistic semantics for gradual argumentation, which builds on the many-valued conditional semantics.

translated by 谷歌翻译

FLAIR #1: semantic segmentation and domain adaptation dataset

Anatol Garioud , Stéphane Peillet , Eva Bookjans , Sébastien Giordano , Boris Wattrelos

分类：计算机视觉

2022-11-23

The French National Institute of Geographical and Forest Information (IGN) has the mission to document and measure land-cover on French territory and provides referential geographical datasets, including high-resolution aerial images and topographic maps. The monitoring of land-cover plays a crucial role in land management and planning initiatives, which can have significant socio-economic and environmental impact. Together with remote sensing technologies, artificial intelligence (IA) promises to become a powerful tool in determining land-cover and its evolution. IGN is currently exploring the potential of IA in the production of high-resolution land cover maps. Notably, deep learning methods are employed to obtain a semantic segmentation of aerial images. However, territories as large as France imply heterogeneous contexts: variations in landscapes and image acquisition make it challenging to provide uniform, reliable and accurate results across all of France. The FLAIR-one dataset presented is part of the dataset currently used at IGN to establish the French national reference land cover map "Occupation du sol \`a grande \'echelle" (OCS- GE).

translated by 谷歌翻译

Characterizing and Detecting State-Sponsored Troll Activity on Social Media

Fatima Ezzeddine , Luca Luceri , Omran Ayoub , Ihab Sbeity , Gianluca Nogara , Emilio Ferrara , Silvia Giordano

分类：机器学习

2022-10-17

The detection of state-sponsored trolls acting in information operations is an unsolved and critical challenge for the research community, with repercussions that go beyond the online realm. In this paper, we propose a novel AI-based solution for the detection of state-sponsored troll accounts, which consists of two steps. The first step aims at classifying trajectories of accounts' online activities as belonging to either a state-sponsored troll or to an organic user account. In the second step, we exploit the classified trajectories to compute a metric, namely "troll score", which allows us to quantify the extent to which an account behaves like a state-sponsored troll. As a study case, we consider the troll accounts involved in the Russian interference campaign during the 2016 US Presidential election, identified as Russian trolls by the US Congress. Experimental results show that our approach identifies accounts' trajectories with an AUC close to 99\% and, accordingly, classify Russian trolls and organic users with an AUC of 97\%. Finally, we evaluate whether the proposed solution can be generalized to different contexts (e.g., discussions about Covid-19) and generic misbehaving users, showing promising results that will be further expanded in our future endeavors.

translated by 谷歌翻译

Neural Transformers for Intraductal Papillary Mucosal Neoplasms (IPMN) Classification in MRI images

Federica Proietto Salanitri , Giovanni Bellitto , Simone Palazzo , Ismail Irmakci , Michael B. Wallace , Candice W. Bolan , Megan Engels , Sanne Hoogenboom , Marco Aldinucci , Ulas Bagci

分类：计算机视觉 | 人工智能

2022-06-21

胰腺中的癌前囊肿或肿瘤的早期检测，即，导管内乳头状粘膜肿瘤（IPMN）是一项具有挑战性且复杂的任务，它可能导致更有利的结果。一旦检测到，还必须准确地对IPMN进行评分，因为低风险IPMN可以在监视计划下进行，而高危IPMN必须在变成癌症之前先手术切除。 IPMN分类的当前标准（Fukuoka等）显示出明显的操作员内和跨操作员变异性，除了容易出错，使适当的诊断不可靠。通过深度学习范式在人工智能方面的既定进展可能为有效支持胰腺癌的医疗决策提供了关键工具。在这项工作中，我们通过提出一种基于AI的新型IPMN分类器来遵循这一趋势，该分类器利用了Transformer网络最近在包括视觉的各种任务（包括视觉的任务）上概括的最新成功。我们特别表明，我们的基于变压器的模型比标准卷积神经网络更好地利用预训练，从而支持视觉中构建的构造统一性，包括医学图像域，并可以更好地解释获得的结果。

translated by 谷歌翻译

On the inability of Gaussian process regression to optimally learn compositional functions

Matteo Giordano , Kolyan Ray , Johannes Schmidt-Hieber

分类： (统计)机器学习 | 机器学习

2022-05-16

我们严格地证明，如果目标函数具有组成结构，那么深层的过程先验可以超越高斯工艺先验。为此，我们研究了连续回归模型中高斯过程回归后收缩率的信息理论下限。我们表明，如果真实函数是广义的加性函数，那么基于任何平均零高斯过程的后验只能以严格慢的速率恢复真相，而该速率比最小值速率慢了，该因子在样品中多项式次优的因素。尺寸$ n $。

translated by 谷歌翻译

The emergence of a concept in shallow neural networks

Elena Agliari , Francesco Alemanno , Adriano Barra , Giordano De Marzo

分类： (统计)机器学习

2021-09-01

我们考虑受限制的Boltzmann机器（RBMS）在非结构化的数据集上培训，由虚构的数据集进行，该数据集由明确的模糊但不可用的“原型”，我们表明，RBM可以学习原型的临界样本大小，即机器可以成功播放作为一种生成模型或作为分类器，根据操作程序。通常，评估关键的样本大小（可能与数据集的质量相关）仍然是机器学习中的一个开放问题。在这里，限制随机理论，其中浅网络就足够了，大母细胞场景是正确的，我们利用RBM和Hopfield网络之间的正式等价，以获得突出区域中突出区域的神经架构的相图控制参数（即，原型的数量，训练集的训练集的神经元数量，大小和质量的数量），其中可以实现学习。我们的调查是通过基于无序系统的统计学机械的分析方法领导的，结果通过广泛的蒙特卡罗模拟进一步证实。

translated by 谷歌翻译

A conditional, a fuzzy and a probabilistic interpretation of self-organising maps

Laura Giordano , Valentina Gliozzi , Daniele Theseider Dupré

分类：人工智能

2021-03-11

在本文中，我们建立了模糊和优惠语义之间的联系，用于描述逻辑和自组织地图，这些地图已被提出为可能的候选人来解释类别概括的心理机制。特别是，我们表明，在训练之后的自组织地图的输入/输出行为可以通过模糊描述逻辑解释以及基于概念 - 方面的多次方法语义来描述逻辑解释以及考虑偏好的优先解释关于不同的概念，最近提出了排名和加权污染描述逻辑。可以通过模型检查模糊或优先解释来证明网络的属性。从模糊解释开始，我们还为此神经网络模型提供了概率账户。

translated by 谷歌翻译